Draft October 2003 3 From recent overviews of annotated
نویسندگان
چکیده
This chapter describes the broad phonemic transcription in the CGN. First a broad overview of phonetic annotations in Dutch corpora is provided and a number of crucial dimensions are discussed: the source of annotation (human or automatic), the type of material involved, the level of transcription and the symbol set and transcription conventions. These dimensions serve as a guide through a number of aspects of the broad phonetic transcription in the CGN. In section 2 the level of transcription is discussed: methodological as well as fairly practical considerations are elaborated on with respect to the detail of phonetic annotations (or transcriptions) as well as the required level of expertise of the transcribers. In section 3 a pilot study is summarized that was meant to address issues such as the source of annotation and the transcription task (transcription ‘from scratch’ or verification of an automatically generated transcription) relative to practical matters such as the estimated transcription time, expected errors and variability. The next sections deal with the protocol: in section 4 the CGN set of phonetic symbols is defined and in section 5 the manual provided for the transcribers is described. The actual transcription procedure is dealt with in section 6: the entire corpus received an automatically generated broad phonetic transcription, ten percent of which, i.e. the ‘core corpus’, is manually verified. In section five the details of the grapheme-to-phoneme conversion of the orthographic annotation is described, as well as the manual verification procedure. The final section of this chapter is a first attempt to assess the quality of the manually verified transcriptions.
منابع مشابه
Braving the broadcast storm: infrastructural support for ad hoc routing
Several routing algorithms for mobile ad hoc networks have been proposed in the recent past [Broch et al., The Dynamic Source Routing Protocol for Mobile Ad Hoc Networks, Internet Draft draft-ietf-manet-dsr-03.txt, October 1999; Perkins et al., Ad Hoc On Demand Distance Vector (AODV) Routing, Internet Draft draft-ietf-manet-aodv04.txt, October 1999; Haas and Pearlman, The Zone Routing Protocol ...
متن کاملDraft Genome Sequences of Saccharibacter sp. Strains 3.A.1 and M18 Isolated from Honey and a Honey Bee (Apis mellifera) Stomach
The annotated draft genome sequences of two recent Saccharibacter sp. strains isolated from honey and a honey bee stomach in 2014 are reported here. Currently, two Saccharibacter whole-genome sequences are available in databases; thus, the sequences of our new isolates will contribute to a better understanding of Saccharibacter genomes.
متن کاملSelected Annotated Bibliography to accompany: Some Foundations in Complex Systems: Tools and Concepts
2 General Complex Systems 2 2.1 Textbooks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 2.2 Popular Books . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 2.3 Overviews, Reviews, and Commentary . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 2.4 Mathematical Modeling and Computational Methods . . . . . . . ...
متن کاملTowards Automatic Web Service Composition using AI Planning Techniques (first draft)
This article discusses how artificial intelligence (AI) planning techniques can be used to enable automatic composition of Web Services. Particulary, the paper discusses how standard Web Service descriptions can be annotated and converted into proper formats like PDDL to enable reasoning with modern AI planning tools.
متن کاملTrends in Marital Stability*
Recent reports about the stability of marriages appear to yield conflicting conclusions. We reconcile these estimates, showing that data from several sources uniformly point to increasing marital stability among those married since the mid-late 1970s. This draft: October 30, 2007 First draft: September 30, 2007
متن کامل